DNA coding for protein binds to enhancer of α-fetoprotein gene

ABSTRACT

This invention relates to a DNA coding for a protein that specifically binds to the enhancer of the α-fetoprotein gene and that promotes transcription of that α-fetoprotein gene. This DNA is useful, by applying recombinant DNA technology, for the construction of highly efficient gene expression system for the production of proteins having physiological activities in animal cells.

This application is a continuation of application Ser. No. 07/635,498, filed Dec. 31, 1990, now abandoned, which is based on international application PCT/JP90/00557 filed Apr. 27, 1990.

FIELD OF THE INVENTION

This invention relates to DNA coding for a protein which specifically binds to the enhancer of α-fetoprotein gene and promotes transcription of α-fetroprotein gene. Since the protein is involved in transcription, this DNA can be used for the construction of a highly efficient gene expression system utilizing animal cells.

BACKGROUND OF THE INVENTION

Expression of genetic information in eucaryotic cells is regulated at the levels of transcription into mRNA, mRNA processing, translation to protein and posttranslational processing. The regulation at the transcription level most strongly influences the expression of genetic information. A promoter which controls transcription is found on the 5' side of genes transcribed by RNA polymerase II and sometimes enhancers, which regulate promoter activity were also found. With the recent discovery of nuclear factors that recognize and bind to specific nucleotide sequences in the promoter and enhancers, it has become evident that the activities of promoter and also enhancers are mediated by the binding of these factors.

Of such factors, for example, studies on proteins which interact with TATA box, CAAT box, or GC box for the stimulation of transcription of RNA polymerase have been undertaken. Cloning of cDNA of CAAT-binding and GC-binding factors has already been accomplished.

Furthermore, factors that bind to enhancers have been investigated and cDNA clones of several transcription factors have been successfully isolated: for example, octamer transcription factor-2 (OTF-2), a B-cell lineage specific factor, that binds to immunoglobulin κ-chain gene enhancer [Michael et al.: Nature, 336, 544-551, (1988)] and OTF-1, which recognizes the same nucleotide sequence and was found ubiquitously in many tissues [R. A. Sturm, G. Das and W. Herr: Gene & Development, 2, 1852 (1988)].

In human α-fetoprotein gene, presence of an enhancer at 3.5 kb on the upstream of transcription start point has been confirmed. A factor that binds to this enhancer has been discovered and named AFP-1 [Sawadaishi et al.: Molecular and Cellular Biology, 8, 5179-5187 (1988)].

However, the structure and physiological properties of AFP-1, and on the gene which codes for AFP-1 has not been elucidated.

OUTLINE OF THE PRESENT INVENTION

The inventors have been investigating a nuclear factor that specifically interacts with a region, characterized by the TTAATAATTA (see ID NO:3) structure that exists in the enhancer of α-fetoprotein gene, and isolated a cDNA that encodes this factor. They determined the nucleotide sequence of the cDNA and deduced the amino acid sequence of the factor and accomplished the present invention.

Therefore, this invention offers a DNA, expressed by nucleotide sequence, that codes for a protein which binds to the enhancer of human α-fetoprotein gene.

The nuclear factor which pertains to this invention and which specifically binds to the enhancer of α-fetroprotein gene, has the amino acid sequence given in FIG. 1, in its primary structure and present in cell nuclei, and can be extracted from α-fetroprotein-producing cells. Thus, it is possible to isolate the DNA defined by the nucleotide sequence given in FIG. 2, using cloning procedures starting with messenger RNA that is isolated from α-fetoprotein-producing cells derived from the liver.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the amino acid sequence (see ID NO:2) of the protein, which binds to the α-fetoprotein gene enhancer, encoded by the DNA (referred to as λ2cDNA) of this invention.

FIG. 2 shows the nucleotide sequence (see ID NO:1) of said DNA.

FIG. 3 describes briefly the procedure for obtaining cDNA starting with poly(A)RNA to introduce it into λgtll expression vector in the example. The small arrows indicate the location in the DNA (SEQ ID NO:4) and amino acid (SEQ ID NO:5) sequences of the lacZ gene where the cDNA is to be inserted.

FIG. 4 illustrates a restriction enzyme map of the enhancer region located on the 5' side of human α-fetoprotein gene. The region from -3.3 kb to -3.7 kb is enlarged.

Hi and Ha signify the recognition sites of restriction enzymes HindIII and HaeIII, respectively.

FIG. 5 displays the result of gel shift assay employing the extracts obtained from λ2 and λgtll lysogens in the example. The control shows the pattern that occurred when the lysogen extract was eliminated. As shown in the figure, the two shifted bands (a and b) are visible only when the extract obtained from λ2 lysogen is used.

FIG. 6, shows the result of DNase I foot print analysis using λ2 lysogen extract in the example. G+A represents a size marker prepared by the chemical cleavage of the probe DNA by the Maxam-Gilbert procedure; lane 1 signifies DNA extracted from the bands which is not retarded; lanes 2 and 3 show DNA extracted from the shifted bands (a) and (b), respectively.

[A] illustrates the pattern when 5' terminus at SMaI site is labeled (the indicated sequence in FIG. 6A is SEQ ID NO:6) labeled; [B] shows the pattern when 5' [B] shows the pattern when 5' terminus at HincII site is labeled; and [C] reveals the combination of the above two results (the upper sequence in FIG. 6C is SEQ ID NO:7; the lower sequence is SEQ ID NO:8).

FIG. 7 indicates the nucleotide sequence (SEQ ID NO:1) of the cDNA cloned into λ2, obtained in the example and the amino acid sequence (SEQ ID NO:2) of the protein encoded by it.

FIG. 8 shows the construction of CAT plasmid pAFl.OE25CAT in the example, which in turn signifies that it was constructed by inserting NlaIV/XhoI fragment (31bp) of the enhancer region into the ClaI site of pAFl.OCAT.

FIG. 9 indicates the CAT assay utilizing cultured human hepatoma cells, HuH-7.

PREFERRED EMBODIMENT OF THE INVENTION

The method for isolating the target DNA fragment pertaining to this invention is described below. The experimental procedure per se can be conducted by conventional methods.

One example of α-fetoprotein-producing cells is the human hepatoma cell line, HuH-7.

In this invention, the above mentioned α-fetoprotein-producing cells are cultured and the proliferated cells are collected, after which their RNA is extracted. The guanidinium isothiocyanate-cesium chloride method [J. M. Chirgwin et al.: Biochemistry, 18, 5294, (1979)] may be used for this purpose. Poly(A)RNA is separated by the standard procedure using oligo (dT) cellulose or the like.

Next, cDNA is synthesized using reverse transcriptase with poly(A)RNA, collected above, as a template. This cDNA is then converted into double-strand cDNA, which is packaged into λphase to make a recombinant phage.

For this purpose, commercial packaging systems, such as Packer Gene Packaging System (Promega Biotec) may be used.

The recombinant phage prepared as described above is then transfected with host enterobacter, e.g. E. coli Y1090 (r⁻) and subsequently plated on agarose plates to obtain a cDNA library. The target DNA, coding for the protein that binds to the α-fetoprotein gene enhancer, can be obtained by screening the library with the DNA fragment that contains the enhancer of α-fetoprotein gene.

The invention is explained in detail by the following example.

EXAMPLE

This example shows the preparation of DNA coding for a protein that binds to the enhancer of α-fetoprotein gene (hereinafter referred to as the enhancer) from poly(A)RNA isolated from HuH-7 cells derived from human hepatoma.

1 Growth of α-fetoprotein producing cells

Human hepatoma cell line, HuH-7, which produce α-fetoprotein was cultured under the following conditions. The cell line is available from Professor Jiro Sato, Pathology Division, Cancer Institute of Okayama University, Japan.

Conditions for culture

The medium used was RPMI-1640 medium added with 3% (w/w) lactalbumin hydrolysate (Gibco) or RPMI-1640 medium containing 5-10% (v/v) fetal calf serum. The medium was replaced as needed, and incubation was performed in an incubator (37° C.) filled with air containing 5% carbon dioxide.

2 Preparation of poly(A)RNA from the cells described above

Total RNA was extracted from 2×10⁸ cells, adopting guanidinium isothiocyanate-cesium chloride method [Biochemistry, 18, 5294 (1979)], by the following procedure.

To the cells was added 20 ml solution composed of six M guanidine isothiocyanate, five mM sodium citrate, 0.1 M 2-mercaptoethanol and 0.5% sodium N-lauroyl sarcosinate, and the cells were homogenized at room temperature, then four g of cesium chloride was dissolved per 10 ml of this homogenate. In polyallomer centrifugal tubes were poured 2.5 ml solution composed of 5.7 M cesium chloride and 0.1 M EDTA (pH 7.5), over which 10 ml of the homogenate was layered. The layered mixture was then centrifuged at 34,000 rpm for 18 hours at 20° C. with Hitachi Ultracentrifuge Rotor RPS 40T (Hitachi Ltd.). The resultant sediment was dissolved in one ml solution composed of 10 mM Tris-HCl (pH 7.4), 5 mM of EDTA and 1% SDS. To this solution, an equal volume of a mixture of chloroform and n-butanol (4:1, v/v) was added, mixed well and centrifuged (16,000 g, 10 min.). To the obtained aqueous phase, one-tenth volume of 3 M sodium acetate (pH 5.5) and 2.5 volumes of ethanol were added and mixed well. The mixture was allowed to stand at -70° C. for more than two hours and centrifuged (16,000 g, 20 min.) to precipitate RNA. The precipitates were washed with 70% ethanol and then dried.

For the preparation of poly(A)RNA from the total RNA, affinity chromatography using oligo (dT) cellulose was employed as described below. Oligo (dT) cellulose (50 mg) was packed in a small column and equilibrated with a solution composed of 10 mM Tris-HCl (pH 7.5), 0.5 M NaCl, one mM of EDTA and 0.1% SDS. Then 390 μg of the total RNA dissolved in the same buffer was applied on the column. RNA that was not retained was washed out by the buffer, and the retained RNA was eluted with a solution composed of 10 mM Tris-HCl (pH 7.5), one mM EDTA and 0.05% SDS. To this eluate, 1/10 volume of 3 M sodium acetate (pH 5.5) and 2.5 volumes of ethanol were added, mixed, and the mixture was allowed to stand for more than two hours at -70° C.. The mixture was then centrifuged (12,000 g, 15 min.). The precipitates formed were washed with 70% ethanol, dried and dissolved in sterilized distilled water.

3 Synthesis of cDNA

A cDNA synthesizing kit (Pharmacia) was used for this purpose.

To the first-strand synthesis reaction mixture, 2.5 μg of poly(A)RNA and 0.4 μg of random primer, dp(N)₆ (Takara Shuzo Co., Ltd.) was added and then allowed to react according to the supplier's protocol. An EcoRI adapter was ligated to both ends of double-strand cDNA as directed by the protocol. From 2.5 μg poly(A)RNA, 2.3 μg of double-stranded cDNA was obtained.

4 Preparation of recombinant DNA and recombinant phage

Protoclone λgtll system (Promega) was used for the preparation of recombinant DNA using the double strand DNA describes above according to the protocol. The recombinant DNA thus obtained was transduced into recombinant phage using Packer, Gene Packaging System (Promega), and then transduced into E. coli Y1090 (r ). This recombinant phage was cultured on agarose plates to make a λgtll cDNA library. The complexity of this library was 3×10⁶.

The processes of preparing cDNA and λgtll recombinant phage are shown in FIG. 3.

5 Preparation of probes and the method for the screening

The above λgtll cDNA library was screened according to the known procedure [Cell, 52, 415, (1988)]. The probes for the screening were prepared from the DNA fragment of the human α-fetoprotein gene enhancer located upstream of the gene (see FIG. 4).

The DNA fragment having enhancer activity was isolated by digestion with restriction enzymes HgiAI and BstNI from the genomic DNA which had been extracted from HuH-7 cells and cloned into a λ phage vector, and its termini were filled with DNA polymerase I Klenow fragment. pUC18 plasmid was also treated with BamHI and the end was similarly treated. The two fragments were ligated using T4DNA ligase (Takara Shuzo Co., Ltd.) to give recombinant plasmid pAFE (HgiAI/BstNI)1.

The pAFE (HgiAI/BstNI) was subsequently digested using restriction enzymes XbaI and PstI, and deleted with exonuclease III and mung-bean nuclease (Takara Shuzo). Then the ends were filled in with DNA polymerase I Klenow fragment and the plasmid was recircularized by insertion of XhoI linker to the ends.

A clone which underwent deletion down to nine bases downstream from TTAATAAT sequence was screened from the deletion mutants obtained by the transduction of the plasmid into E. coli DH5 and designated as pAFE(HgiAI/XhoI)1. The pAFE(HgiAI/XhoI)1 was then hydrolyzed with restriction enzymes NlaIV and XhoI, and, a fragment containing enhancer was isolated by agarose gel electrophoresis. Ends of the (NlaIV/XhoI fragment was filled in with DNA polymerase I Klenow fragment, after which the fragments were self-ligated using T4 DNA ligase, and then the catenated DNA was cloned into HincII digested pUC18 plasmid using T4 ligase. E. coli DH5 α (BRL) was transformed with the obtained recombinant plasmid and an appropriate clone was screened from the recombinants on the basis of the number of NlaIV/XhoI fragment. Six NlaIV/XhoI fragments were catenated in the plasmid isolated from a clone at restriction site of HincII. This plasmid, named pAFE(NlaIV/XhoI)6, was cleaved with restriction enzymes SmaI and HincII (Takara Shuzo). Then the SmaI/HincII fragment containing the enhancer was isolated, labeled with T4 polynucleotide kinase and 5'[γ³² P] ATP and used as a probe for screening (referred to as the probe DNA).

According to the known method [Cell, 52, 415 (1988)], 3×10⁵ plaques from the aforementioned library, which had been obtained by plating the recombinant phage, were screened to isolate clones that specifically interact with the probe. The single positive clone was named λ2. The agarose gel containing λ2 plaque was cut out of the plate and was suspended in a solution composed of 50 mM Tris-HCl (pH 7.5), 0.1 M NaCl, 8.1 mM MgSO₄ and 0.1% of gelatin to extract the phage. A drop of chloroform was added to the suspension and kept at 4° C..

6 Preparation of lysogen extracts

The phage prepared above was transduced into E. coli Y1089 according to a known method [DNA Cloning, Vol. 1, p. 49; Ed. by D. M. Grover, IRL Press (1985)] to prepare lysogens, from which an extract was prepared following the method described by Singh [Cell, 52, 415 (1988)]. As a control, λgtll phage having no cDNA was also transduced into E. coli Y1089, from which an extract was prepared by the similar method.

7 Gel Shift Assay

Gel shift assay was performed by a known method [Nature, 319, 154 (1988)] using the above mentioned lysogen extracts and DNA probe prepared from pAFE(HgiAI/BstNI) described in 5 as follows.

pAFE(HgiAI/BstNI)1 was eleaved with SamI and HincII to give an enhancer DNA fragment. A probe was prepared by labeling the SmaI/HincII fragment with 32p at the 5'-ends.

As shown in FIG. 5, two shifted bands (a and b) were detected in the extract made from λ2 lysogen, while such bands were undetectable by λgtll lysogen extract having no cDNA. The results indicated that protein encoded by the cDNA cloned in λ2 has a property of binding to the probe DNA.

8 Identification of protein-binding site on the DNA fragment by footprint analysis

To a 75 μl of reaction solution containing 10 mM of Tris-HCl (pH 7.5), 50 mM NaCl, one mM DTT, 0.5 mM of EDTA, 2.5 mM of MgCl₂, three μg poly(dI-dC)·poly(dI-dC) and 5% glycerol, λ2 lysogen extract containing 16 μg protein (as total protein) and SmaI/HincII probe, prepared from pAFE (HgiAI/BstNI)1 described in 5, was mixed and allowed to stand at room temperature for 30 minutes. The probe was labeled with ³² P on only one of the 5' termini. To this mixture, one μl of DNase I (40 ng/μl) was added and caused to react for one minute at room temperature, after which 2.3 μl of 0.5 M EDTA was added to cease the reaction. The reaction mixture was applied to a 5% polyacrylamide gel for gel shift assay, and was subjected to electrophoresis at 11 V/cm until bromophenol blue migrated close to the lower end of the gel.

After removing one glass plate, the gel was wrapped with Saran Wrap (Asahi Chemical Ind. Co., Ltd.), against which an X-ray film was closely placed to make an autoradiogram.

Portions of gel that correspond to the position of DNA bound to protein and that of unbound DNA, determined by the relative mobility of the bands on the autoradiogram, were cut out of the gel. The gel sections were plated separately in 0.5 ml solution composed of 0.5 M ammonium acetate, 0.1% of SDS and one mM of EDTA, and allowed to stand overnight at room temperature to extract the DNA. The mixture was centrifuged (12,000 g, 10 min.) to obtain the supernatant. The gel sections were washed twice with 0.5 ml each of the same elution buffer, after which the washings were mixed with the supernatant (1.3 ml). To this mixture, five μg of yeast transfer RNA (Sigma) was added and the resultant mixture was extracted with a mixture of phenol and chloroform (1:1). To the aqueous phase thus obtained, two volumes of ethanol was added and then allowed to stand for 30 minutes or longer at -70° C. The DNA was then collected by centrifugation (12,000 g, 10 min.), washed once with 70% ethanol and was dried under vacuum. This DNA was dissolved in 10 μl solution composed of 80% formamide, 10 mM of NaOH, one mM of EDTA, 0.1% xylenecyanol and 0.1% bromophenol blue, heated at 90° C. for three minutes, after which 3-4 μl of the solution was applied to an 8% polyacrylamide gel containing seven M urea, which is commonly used to determine nucleotide sequences. The samples were subjected to electrophoresis, together with size markers, which had been prepared from the probe DNA by chemically cleaving at guanine and adenine residues [Method in Enzymology, 65, 499 (1980)].

Results of autoradiography performed after electrophoresis are shown in FIG. 6. As shown in the figure, it was indicated that, in DNA bound to the protein encoded by μ2 cDNA, the 15 base pairs containing TTAATAATTA (SEQ ID NO:3) in the middle were protected from cleavage by DNaseI, contrary to the DNA not bound to the protein.

9 Nucleotide sequence of cDNA cloned into λ2 phage

DNA of λ2 phage was prepared according to the conventional method [Molecular Cloning, a Laboratory Manual, p. 76 (1982)]. After hydrolysis with EcoRI, an insert DNA fragment was separated by an agarose gel electrophoresis.

FIG. 7 shows the nucleotide sequence of the DNA, determined by dideoxy method [Method in Enzymology, 65, 560 (1980)] and Maxam-Gilbert method [Method in Enzymology, 65, 499 (1989)], and the amino acids sequence of the protein encoded by the DNA.

The fact that the enhancer activity resides on the DNA region to where λ2 cDNA encoded protein binds was confirmed by the following CAT assay.

10 CAT assay

The NlaVI/XhoI fragment isolated from pAFE (HgiAI/XhoI)1, described in 5, was inserted into the ClaI site of pAFEl.OCAT, a vector for CAT assay, by blunt-end ligation to obtain pAFl.OE25CAT (see FIG. 8).

The pAFl.OCAT contains one kb DNA of the α-fetoprotein gene promoter region, structural gene for chloramphenicol acetyltransferase of E. coli, and SV40 poly(A) addition signal and t-antigen intron. The vector is used to screen DNA fragment having enhancer activity [J. Biol. Chem., 262, 4812 (1987)]. Transfection of CAT plasmids into HuH-7 cells and CAT assay were performed as follows, based on the known method [Mol. Cell Biol., 2, 1044 (1982)].

About 8×10⁶ cells were grown on a 75 cm² culture flask. To these cells, 20 to 30 μg CAT plasmid DNA was transfected by calcium phosphate method. The cells were incubated for two days in RPMI-1640 culture medium containing 3% lactalbumin hydrolysate, after which the cells were washed with PBS, and scraped off using policeman. Then the cells were collected by low-speed centrifugation (800 rpm, 5 min.), washed once with a solution composed of 40 mM Tris-HCl (pH 7.5), one mM EDTA and 150 mM NaCl, and suspended in 100 μl of 250 mM Tris-HCl (pH 7.5). This suspension was repeatedly frozen and thawed five times to lyse the cells, and centrifuged in an Eppendorf centrifuge at 12,000 rpm for five minutes to obtain a supernatant, which is then heat-treated at 65° C. for 10 minutes. The treated supernatant was again centrifuged at 12,000 rpm for five minutes with the same machine to obtain a supernatant which was used as a cell extract to analyze chloramphenicol acetyltransferase activity.

After determining the protein concentration of the cell extract using the protein assay kit (Biolad), 50-200 μg of protein was used for the following CAT assay.

A typical CAT assay reaction mixture, 180 μl, contains 250 mM Tris-HCl (pH 7.5), 0.1 μCi[¹⁴ C] chloramphenicol (Amersham), 0.4 mM acetyl-CoA, and 50-200 μg protein extracted from the cells. This reaction mixture was incubated at 37° C. for 60-180 minutes, after which one ml of ethyl acetate was added to extract chloramphenicol and its acetylated derivatives. The ethyl acetate layer was then separated and dried in vacuo. The residue was redissolved in 20 μl of ethyl acetate, a portion or all of which was applied onto a silica gel plate (Merck). The resultant plate was then developed using chloroform:methanol (96:4, v/v), after which the silica gel plate was dried at room temperature, Saran-wrapped, and subjected to autoradiography. As illustrated in FIG. 9, a greater CAT activity was observed when the experiment was performed with CAT plasmid, pAFl.OE25CAT, which contained the DNA of the enhancer region in the upstream of α-fetroprotein gene, than that of pAFl.OCAT, indicating the presence of an enhancer activity in NlaIV/XhoI fragment.

Industrial Applicability

This DNA is useful in the efficient production of physiologically active proteins in cultured cells, by transfecting (a) an expression vector which produce the enhancer binding protein and (b) an expression vector which contains the enhancer and human α-fetroprotein gene promoter to promote transcription of genes encoding physiologically active proteins.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 8                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 1091 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (iii) HYPOTHETICAL: NO                                                         (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        CATGTCCTCAGTTAATCTAAACTTTGACCAAACTAAGCTGGACAACGATGACTGTTCCTC60                 TGTCAACACAGCAATCACAGATACCACAACTGGAGACGAGGGCAACGCAGATAACGACAG 120               TGCAACGGGAATAGCAACTGAAACCAAATCCTCTTCTGCACCCAACGAAGGGTTGACCAA180                AGCGGCCATGATGGCAATGTCTGAGTATGAAGATCGGTTGTCATCTGGTCTGGTCAGCCC240                GGCCCCGAGCTTTTATAGCAAGGAATATGACAATGAAGG TACAGTGGACTACAGTGAAAC300               CTCAAGCCTTGCAGATCCCTGCTCCCCGAGTCCTGGTGCGAGTGGATCTGCAGGCAAATC360                TGGTGACAGCGGGGATCGGCCTGGGCAGAAACGTTTTCGCACTCAAATGACCAATCTGCA420                GCTGAAGGTCCTCAAG TCATGCTTTAATGACTACAGGACACCCACTATGCTAGAATGTGA480               GGTCCTGGGCAATGACATTGGACTGCCAAAGAGAGTCGTTCAGGTCTGGTTCCAGAATGC540                CCGGGCAAAAGAAAAGAAGTCCAAGTTAAGCATGGCCAAGCATTTTGGTATAAACCAAAC 600               GAGTTATGAGGGACCCAAAACAGAGTGCACTTTGTGTGGCATCAAGTACAGCGCTCGGCT660                GTCTGTACGTGACCATATCTTTTCCCAACAGCATATCTCCAAAGTTAAAGACACCATTGG720                AAGCCAGCTGGACAAGGAGAAAGAATACTTTGACCCAGC CACCGTACGTCAGTTGATGGC780               TCAACAAGAGTTGGACCGGATTAAAAAGGCCAACGAGGTCCTTGGACTGGCAGCTCAGCA840                GCAAGGGATGTTTGACAACACCCCTCTTCAGGCCCTTAACCTTCCTACAGCATATCCAGC900                GCTCCAGGGCATTCCT CCTGTGTTGCTCCCGGGCCTCAACAGCCCCTCCTTGCCAGGCTT960               TACTCCATCCAACACAGCTTTAACGTCTCCTAAGCCGAACTTGATGGGTCTGCCCAGCAC1020               AACTGTTCCTTCCCCTGGCCTCCCCACTTCTGGATTACCAAATAAACCGTCCTCAGCGTC 1080              GCTGAGCTCCC1091                                                                (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 363 amino acids                                                    (B) TYPE: amino acid                                                           (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (vi ) ORIGINAL SOURCE:                                                         (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetSerSerValAsnLeuAsnPheAspGlnThrLysLeuAspAsnAsp                               151015                                                                          AspCysSerSerValAsnThrAlaIleThrAspThrThrThrGlyAsp                              202530                                                                         GluGlyAsnAlaAspAsnAspSerAlaThrGlyIleAlaThrGluThr                                354045                                                                        LysSerSerSerAlaProAsnGluGlyLeuThrLysAlaAlaMetMet                               505560                                                                         AlaMetSerGlu TyrGluAspArgLeuSerSerGlyLeuValSerPro                              65707580                                                                       AlaProSerPheTyrSerLysGluTyrAspAsnGluGlyThrValAsp                                859095                                                                        TyrSerGluThrSerSerLeuAlaAspProCysSerProSerProGly                               100105110                                                                      AlaSe rGlySerAlaGlyLysSerGlyAspSerGlyAspArgProGly                              115120125                                                                      GlnLysArgPheArgThrGlnMetThrAsnLeuGlnLeuLysValLeu                               130 135140                                                                     LysSerCysPheAsnAspTyrArgThrProThrMetLeuGluCysGlu                               145150155160                                                                   ValLeuGly AsnAspIleGlyLeuProLysArgValValGlnValTrp                              165170175                                                                      PheGlnAsnAlaArgAlaLysGluLysLysSerLysLeuSerMetAla                                180185190                                                                     LysHisPheGlyIleAsnGlnThrSerTyrGluGlyProLysThrGlu                               195200205                                                                      CysThrLeu CysGlyIleLysTyrSerAlaArgLeuSerValArgAsp                              210215220                                                                      HisIlePheSerGlnGlnHisIleSerLysValLysAspThrIleGly                               225 230235240                                                                  SerGlnLeuAspLysGluLysGluTyrPheAspProAlaThrValArg                               245250255                                                                      GlnLeuM etAlaGlnGlnGluLeuAspArgIleLysLysAlaAsnGlu                              260265270                                                                      ValLeuGlyLeuAlaAlaGlnGlnGlnGlyMetPheAspAsnThrPro                                275280285                                                                     LeuGlnAlaLeuAsnLeuProThrAlaTyrProAlaLeuGlnGlyIle                               290295300                                                                      ProProValLeuLeuPr oGlyLeuAsnSerProSerLeuProGlyPhe                              305310315320                                                                   ThrProSerAsnThrAlaLeuThrSerProLysProAsnLeuMetGly                                325330335                                                                     LeuProSerThrThrValProSerProGlyLeuProThrSerGlyLeu                               340345350                                                                      ProAsnLys ProSerSerAlaSerLeuSerSer                                             355360                                                                         (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 10 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (iii) HYPOTHETICAL: NO                                                          (vi) ORIGINAL SOURCE:                                                         (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (ix) FEATURE:                                                                  (A) NAME/KEY: enhancer                                                         (B) LOCATION: 1..10                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        TTAATAATTA10                                                                   (2) INFORMATION FOR SEQ ID NO:4:                                                (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 12 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Bacteriophage lambda                                             (B) STRAIN: lambda gt11                                                        (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT: lacZ gene EcoRI site                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        GCGG AATTCCAG12                                                                (2) INFORMATION FOR SEQ ID NO:5:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 4 amino acids                                                      (B) TYPE: amino acid                                                           (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: peptide                                                    (vi) ORIGINAL SOURCE:                                                           (A) ORGANISM: Bacteriophage lambda                                            (B) STRAIN: lambda gt11                                                        (viii) POSITION IN GENOME:                                                     (A) CHROMOSOME/SEGMENT: lacZ gene EcoRI site                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        AlaGluPheGln                                                                   (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 15 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       ( D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        TGATTAATAATTACA15                                                              (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  ( A) LENGTH: 32 base pairs                                                     (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA to mRNA                                               (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        AGGGAGCCTGATTAATAATTACACTAAGTCAA 32                                            (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 32 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: double                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: cDNA                                                       (vi) ORIGINAL SOURCE:                                                          (A) ORGANISM: Homo sapiens                                                     (H) CELL LINE: HuH-7 (human hepatoma)                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        TTGA CTTAGTGTAATTATTAATCAGGCTCCCT32                                        

We claim:
 1. An isolated DNA sequence for coding for a protein having the amino acid sequence SEQ ID NO:2.
 2. A DNA of claim 1, wherein said protein-coding DNA comprises the sequence SEQ ID NO:1.
 3. An isolated protein having the amino acid sequence SEQ ID NO:2. 